Microsoft Certification AI-102

Azure AI Engineer Associate
Complete Study Course

A comprehensive, exam-aligned interactive course covering all 6 domains of the AI-102 exam. Work through every topic, then test yourself with 50 realistic practice questions.

6
Exam Domains
50
Practice Questions
700
Passing Score /1000
100m
Exam Duration
Dec 2025
Objectives Revision

📋
Exam Domain Weights (2025–2026)

20–25%
Domain 01
Plan & Manage Azure AI
Foundry Services, deployment, monitoring, security, responsible AI
15–20%
Domain 02
Generative AI Solutions
Azure Foundry, OpenAI, RAG, prompt engineering, fine-tuning
5–10%
Domain 03
Agentic AI Solutions
Custom agents, multi-agent orchestration, Microsoft Agent Framework
10–15%
Domain 04
Computer Vision
Image analysis, OCR, custom vision, video indexing, spatial analysis
15–20%
Domain 05
Natural Language Processing
Text analytics, translation, speech, custom language models, QnA
15–20%
Domain 06
Knowledge Mining
Azure AI Search, Document Intelligence, Content Understanding

🎯
Exam Quick Facts

Passing Score700 / 1000 (70%)
Duration100 minutes
Questions~40–60 items (MCQ, case studies, labs)
Code LanguagePython or C# — choose at start, cannot change
Validity12 months — free annual renewal via MS Learn
Retake PolicyWait 24 hours after first fail; varies for subsequent
Price~$165 USD (varies by region)
Exam Focus (2025)Product knowledge, service limits, GenAI, Foundry, Responsible AI

📚
Recommended Study Path

  1. Complete all 6 domain modules in order (click each in the sidebar)
  2. Use the code examples to practice with Azure SDKs (Python preferred)
  3. Do hands-on labs in Azure portal — create real resources
  4. Take the 50-question mock exam — aim for ≥85% before booking
  5. Review wrong answers — each has a full explanation
  6. Revisit the official Microsoft Learn paths for weak areas
  7. Book the exam via Pearson VUE at least 1 week out
⚠ Key Exam Focus (Jan 2026 insight)
The current exam emphasizes product knowledge — service capabilities, file type limits, API names — over architecture. Know Foundry Hub vs Project, content safety filters, specific service features by name.
Domain 1 · Plan & Manage Azure AI

1.1 — Microsoft Foundry Services & Selection

~25 min · 20–25% of exam

🏗
Azure AI Foundry Overview

Azure AI Foundry (formerly Azure AI Studio) is Microsoft's unified platform for building, deploying, and managing enterprise AI solutions. It organizes work into Hubs and Projects.

ConceptRoleScope
Foundry HubEnterprise-level container; manages shared resources, security, network configOrganization-wide
Foundry ProjectTeam/workload container; inherits Hub security; where work actually happensTeam / solution
Foundry Agent ServiceManaged service for creating and deploying AI agentsProject-level
ℹ Exam note
Foundry Hubs are evaluated for network security configuration; Projects are evaluated for model deployment and prompt flow. Know which resources are managed at which level.

🔧
Selecting the Right Azure AI Service

You must match a business scenario to the correct Azure AI service family. This is heavily tested.

RequirementService to SelectNotes
Generate text, code, images from LLMAzure OpenAI in FoundryGPT-4o, DALL-E, Whisper
Analyze images, detect objectsAzure Vision in Foundry ToolsFormerly Computer Vision / Custom Vision
Speech-to-text, text-to-speechAzure Speech in Foundry ToolsFormerly Azure Cognitive Speech
Translate text/documentsAzure Translator in Foundry ToolsCustom translator available
Extract key phrases, entities, sentimentAzure Language in Foundry ToolsFormerly Text Analytics
Extract data from forms, invoices, IDsAzure Document IntelligenceFormerly Form Recognizer
Full-text + vector search over docsAzure AI SearchFormerly Cognitive Search
Index and search video contentAzure AI Video IndexerExtract insights from video/audio
Moderate content in images/textAzure AI Content SafetyPart of Responsible AI tooling
Understand docs, images, video, audioAzure Content UnderstandingNew multimodal service in Foundry Tools

Creating & Deploying AI Services

Azure AI resources can be deployed as multi-service resources or single-service resources.

  • Multi-service resource: Single endpoint and key for multiple AI services — simpler, fewer resources to manage.
  • Single-service resource: Dedicated resource per service — better isolation, service-specific tiers and limits.
  • Resources require a resource group, region, pricing tier, and name.
  • Some services (e.g., OpenAI) require separate approved subscriptions.
# Create Azure AI multi-service resource with Azure CLI az cognitiveservices account create \ --name my-ai-resource \ --resource-group my-rg \ --kind CognitiveServices \ --sku S0 \ --location eastus \ --yes
💡 Deployment tip
Containers: Most Azure AI services support container deployment for edge/local use. Pull from MCR (Microsoft Container Registry). Containers still require connectivity to Azure for billing/metering unless using "Disconnected Containers".

🔑
Authentication Methods

MethodWhen to UseSecurity Level
API Key (Ocp-Apim-Subscription-Key)Simple apps, quick testingMedium
Microsoft Entra ID (AAD)Enterprise, production, RBAC requiredHigh
Managed IdentityAzure-hosted apps, no credentials to manageHigh
SAS TokenStorage access, time-limited accessMedium-High
⚠ Key Management
Always store keys in Azure Key Vault, not in code or config files. Use key rotation — services have 2 keys so you can rotate without downtime. Never embed keys in source code.
Domain 1 · Plan & Manage Azure AI

1.2 — Deploy, Monitor & Secure Services

~20 min · 20–25% of exam

📊
Monitoring Azure AI Resources

Azure AI services integrate with Azure Monitor for metrics, logs, and alerts.

  • Metrics: Request count, latency, error rate, throttled calls — available via Azure Monitor.
  • Diagnostic Logs: Must be explicitly enabled. Send to Log Analytics, Storage Account, or Event Hub.
  • Log Analytics Workspace: Use KQL (Kusto Query Language) to query logs.
  • Alerts: Configure metric alerts for anomaly detection (e.g., error rate > 5%).
  • Azure AI Foundry: Model monitoring for generative AI — monitors performance, safety, resource consumption.
// Example: Query AI service errors in Log Analytics (KQL) AzureDiagnostics | where ResourceType == "COGNITIVESERVICES" | where ResultType == "Failed" | summarize ErrorCount = count() by bin(TimeGenerated, 1h) | render timechart

🔒
Network Security for Foundry Services

  • Public endpoint: Default — accessible from internet (use firewall rules + key auth).
  • Virtual Network (VNet) Service Endpoint: Restricts access to specific VNet subnets.
  • Private Endpoint: Private IP in your VNet — no traffic over internet; best for enterprise.
  • IP Firewall Rules: Allowlist specific IP ranges for public endpoint.
  • Foundry Hub networking: Configure at Hub level — Projects inherit the network configuration.
ℹ Exam focus
Know when to use Private Endpoint vs Service Endpoint. Private Endpoint is the most secure option and is commonly tested in scenario questions about enterprise data privacy.

💰
Managing Costs

  • Pricing tiers: Free (F0), Standard (S0), and premium tiers — F0 has usage limits (typically 5k calls/month).
  • Token-based billing: Azure OpenAI bills per 1K input and output tokens separately.
  • PTU (Provisioned Throughput Units): Reserved capacity for Azure OpenAI — predictable latency and cost.
  • Azure Cost Management: Use budgets and alerts to prevent overspend.
  • Content caching: Prompt caching can reduce OpenAI costs for repeated prompts.

🔄
CI/CD Integration

Integrate Azure AI resources into automated pipelines using:

  • Azure DevOps / GitHub Actions: Deploy AI resources via ARM templates, Bicep, or Terraform.
  • Azure AI Foundry SDK: Use azure-ai-projects Python SDK to automate model deployment in pipelines.
  • Model versioning: Track model versions; use deployment slots for staged rollouts.
  • Prompt Flow: CI/CD for LLM applications — version, test, and deploy prompt flows as endpoints.
# Example: Deploy Azure AI resource using Bicep in CI/CD # bicep template call in GitHub Actions az deployment group create \ --resource-group $RESOURCE_GROUP \ --template-file ai-resource.bicep \ --parameters location=eastus tier=S0
Domain 1 · Plan & Manage Azure AI

1.3 — Responsible AI

~20 min · 20–25% of exam

Microsoft's 6 Responsible AI Principles

Fairness
AI treats all people equitably, avoiding biased outcomes
Reliability & Safety
Performs as designed, safe in unexpected conditions
Privacy & Security
Respects data privacy and protects user information
Inclusiveness
Empowers everyone regardless of ability or background
Transparency
Understandable and explainable AI decisions
Accountability
Humans are accountable for AI decisions and outcomes

🛡
Azure AI Content Safety Features

FeatureWhat It DoesUse Case
Content FiltersBlock harmful categories (hate, sexual, violence, self-harm) in LLM I/OAll GenAI deployments
Prompt ShieldsDetect jailbreak attempts and prompt injection attacksProtect system prompts
Groundedness DetectionDetect hallucinations — ungrounded claims in RAG responsesRAG/QA systems
BlocklistsCustom word/phrase lists to block specific contentBrand protection, compliance
Harm DetectionDetect self-harm, extremist content, material endangering safetyConsumer apps
Content Moderation APIModerate text and images for safe categoriesUGC platforms

📋
Responsible AI Governance Framework

Designing a governance framework involves:

  1. Impact Assessment: Identify potential harms before deployment
  2. Model Cards: Document model capabilities, limitations, and intended uses
  3. Human-in-the-loop: Define when human review is required
  4. Monitoring: Continuously monitor for fairness drift and harmful outputs
  5. Incident Response: Define escalation path for AI failures
  6. Data Governance: Document training data sources and consent
💡 Exam tip
Responsible AI is integrated throughout all Azure AI services — not just a standalone feature. Content filters in Azure OpenAI are enabled by default and cannot be fully disabled without explicit Microsoft approval.
Domain 2 · Generative AI

2.1 — Building GenAI Solutions with Azure Foundry

~30 min · 15–20% of exam

🏗
Azure AI Foundry Setup

To build a GenAI solution in Foundry:

  1. Create a Foundry Hub (top-level, manages shared infra + security)
  2. Create a Foundry Project under the Hub (workspace for the solution)
  3. Deploy a model within the project (GPT-4o, Llama 3, etc.)
  4. Build with Prompt Flow or SDK
  5. Evaluate with built-in evaluators
  6. Deploy as a managed online endpoint

🔄
RAG — Retrieval Augmented Generation

RAG grounds model responses in your private data. Key components:

ComponentAzure ServiceRole
Embedding Modeltext-embedding-ada-002Convert text to vectors
Vector StoreAzure AI SearchStore and search vectors
RetrieverAI Search (hybrid)Find relevant chunks
GeneratorAzure OpenAI (GPT-4o)Generate grounded answer
OrchestrationPrompt Flow / LangChainConnect pipeline
ℹ RAG Pattern in Foundry
In Azure AI Foundry, use "Add your data" to connect Azure Blob Storage or AI Search to a playground/deployment. This automatically configures the retrieval pipeline.

📊
Prompt Flow

Prompt Flow is an LLM-ops tool for building, testing, and deploying LLM applications.

  • Flow types: Standard (general), Chat (multi-turn), Evaluation (scoring)
  • Nodes: LLM, Python, Prompt, Tool — connected as a DAG
  • Variants: Test different prompt versions in parallel
  • Evaluators: Built-in metrics — groundedness, relevance, coherence, fluency, similarity
  • Deployment: Deploy flows as REST endpoints with auto-scaling
# Prompt Flow YAML node (LLM node example) name: answer_question type: llm source: type: code path: answer_prompt.jinja2 inputs: context: ${retrieve_docs.output} question: ${inputs.question} connection: my-gpt4o-connection api: chat

📐
Prompt Engineering Techniques

TechniqueDescriptionBest For
Zero-shotNo examples; rely on model knowledgeSimple, well-defined tasks
Few-shotProvide 2–5 examples in the promptPattern-following tasks
Chain-of-Thought"Think step by step" — intermediate reasoningMath, logic, multi-step problems
System MessageDefine persona, constraints, output formatAll chat applications
Meta PromptingAsk model to generate its own prompt structureComplex workflows
Retrieval AugmentationInject retrieved context into promptQA over private data
Domain 2 · Generative AI

2.2 — Azure OpenAI Service & Models

~25 min · 15–20% of exam

🤖
Azure OpenAI Models

ModelCapabilityKey Use
GPT-4oMultimodal: text, image, audio inputComplex reasoning, vision tasks
GPT-4o miniFast, cost-efficient version of GPT-4oHigh-volume, lower-cost use cases
o1 / o3Reasoning models — extended thinkingMath, science, code reasoning
text-embedding-3Generate text embeddings for semantic searchRAG, semantic similarity
DALL-E 3Generate images from text promptsCreative content, image generation
WhisperSpeech-to-text transcriptionAudio transcription, subtitles
TTSText-to-speech (via OpenAI endpoint)Spoken responses

Key Parameters for Generation Control

ParameterRangeEffect
temperature0.0 – 2.0Randomness — 0=deterministic, 1=creative, 2=chaotic
top_p0.0 – 1.0Nucleus sampling — prefer over temperature; 0.9 = top 90% likely tokens
max_tokens1 – model maxMaximum output tokens (affects cost)
frequency_penalty-2.0 – 2.0Reduce repetition of frequent tokens
presence_penalty-2.0 – 2.0Encourage new topics in output
stopstring[]Stop generation when sequence is found
seedintegerReproducible outputs (same seed = same result)
⚠ Don't use both
Don't set both temperature and top_p at non-default values simultaneously — use one or the other for controlling randomness.

🖼
DALL-E & Multimodal

DALL-E 3 generates images from text via the Azure OpenAI endpoint:

from openai import AzureOpenAI client = AzureOpenAI(azure_endpoint=endpoint, api_key=key, api_version="2024-02-01") result = client.images.generate( model="dall-e-3", prompt="A futuristic Azure datacenter with glowing blue lights", n=1, size="1024x1024", quality="hd" ) image_url = result.data[0].url

For GPT-4o vision, pass image as base64 or URL in the user message content array with type: "image_url".

🎛
Fine-tuning

Fine-tuning adapts a base model on your domain-specific data:

  • Supported models: GPT-4o mini, GPT-3.5 Turbo (check current availability)
  • Training format: JSONL with {"messages": [...]} pairs
  • Minimum dataset: 10 examples (recommend 50–100+ for good results)
  • When to fine-tune: Consistent style/format, domain terminology, not for adding new knowledge (use RAG instead)
  • Cost: Training cost + higher inference cost vs base model
Domain 2 · Generative AI

2.3 — Optimize & Operationalize GenAI

~20 min · 15–20% of exam

📈
Model Evaluation Metrics

MetricWhat It MeasuresScale
GroundednessAre claims in the response supported by the retrieved context?1–5
RelevanceHow relevant is the response to the question?1–5
CoherenceIs the response logically consistent and well-structured?1–5
FluencyIs the response grammatically correct and readable?1–5
F1 ScoreToken overlap between generated and ground truth answers0–1
SimilaritySemantic similarity to ground truth0–1

🔍
Tracing & Feedback

  • Azure AI Foundry tracing: Track each step in a prompt flow — inputs, outputs, latency per node.
  • OpenTelemetry integration: Export traces to Azure Monitor / Application Insights.
  • Feedback collection: Collect thumbs up/down ratings; feed back into evaluation datasets.
  • Model reflection: Use a secondary LLM call to self-critique the primary response.

🌊
Deployment & Scalability

  • Standard deployment: Pay-per-token, auto-scales to Azure capacity.
  • PTU deployment: Provisioned Throughput Units — reserved compute, consistent latency, hourly billing.
  • Container deployment: Deploy Foundry models to ACI, AKS, or edge (IoT) using containers.
  • Multi-model orchestration: Route different request types to specialized models (e.g., GPT-4o for complex, GPT-4o mini for simple).
Domain 3 · Agentic AI

3.1 — Building Custom Agents

~25 min · 5–10% of exam

🤖
What is an AI Agent?

An AI agent is an autonomous system that perceives its environment, makes decisions, and takes actions to achieve goals — using LLMs as the reasoning engine.

  • Core loop: Perceive → Reason → Act (ReAct pattern)
  • Tools: Agents call external tools/APIs — code interpreter, search, databases, custom APIs
  • Memory: Short-term (conversation context), long-term (vector stores), episodic
  • Planning: Break complex tasks into sub-tasks; execute step-by-step
  • Reflection: Self-evaluate outputs and retry if needed

🏗
Microsoft Foundry Agent Service

The Azure AI Foundry Agent Service provides a managed platform for building and deploying agents:

  • Create agents via Foundry portal or SDK with instructions, model, tools, and thread management.
  • Built-in tools: Code Interpreter, File Search (RAG), Function Calling, Azure AI Search, Bing Search.
  • Threads: Manage conversation state; messages persist in thread context.
  • Runs: Execute agent on a thread; poll for completion or use streaming.
from azure.ai.projects import AIProjectClient client = AIProjectClient.from_connection_string(conn_str=conn_str) agent = client.agents.create_agent( model="gpt-4o", name="my-agent", instructions="You are a helpful data analyst. Use code to solve problems.", tools=[{"type": "code_interpreter"}] ) thread = client.agents.create_thread() client.agents.create_message(thread.id, role="user", content="Analyze this dataset") run = client.agents.create_and_process_run(thread.id, agent.id)

🔗
Multi-Agent Orchestration

  • Microsoft Agent Framework (Semantic Kernel / AutoGen): Build complex workflows with multiple specialized agents.
  • Orchestrator agent: Routes tasks to specialized sub-agents.
  • Sub-agents: Each handles a specific domain (e.g., SQL agent, document agent, search agent).
  • Autonomous capabilities: Agents can trigger other agents; use event-driven patterns.
  • Testing: Test agents with diverse scenarios; evaluate tool use accuracy and goal completion.
⚠ Exam scope
The agentic domain (5–10%) focuses on knowing when to use agents, what Foundry Agent Service provides, and the concept of multi-agent orchestration — not deep framework internals.
Domain 4 · Computer Vision

4.1 — Image Analysis & OCR

~25 min · 10–15% of exam

🖼
Azure Vision in Foundry Tools — Capabilities

FeatureWhat It ReturnsAPI Parameter
TagsDescriptive labels (e.g., "dog", "outdoor") with confidence scoresvisualFeatures=Tags
ObjectsDetected objects with bounding boxesvisualFeatures=Objects
DescriptionNatural language caption of the imagevisualFeatures=Description
FacesFace detection with bounding boxes (no identification)visualFeatures=Faces
ColorDominant/accent colors, black&white detectionvisualFeatures=Color
Image TypeClip art, line drawing detectionvisualFeatures=ImageType
AdultAdult/racy content scores (0–1)visualFeatures=Adult
Smart CropsSuggested crop regionssmartCrops parameter

📄
OCR & Read API

The Read API (Azure Vision 4.0 Read) extracts text from images and documents:

  • Input formats: JPEG, PNG, BMP, TIFF, PDF — max 50 MB, max 10,000 pages
  • Languages: 164+ languages supported
  • Handwriting: Detect and extract handwritten text
  • Response: Returns text lines, words, bounding polygons, confidence scores
  • Async operation: For large documents — submit and poll with operation ID
from azure.ai.vision.imageanalysis import ImageAnalysisClient from azure.ai.vision.imageanalysis.models import VisualFeatures client = ImageAnalysisClient(endpoint=endpoint, credential=AzureKeyCredential(key)) result = client.analyze_from_url( image_url="https://example.com/image.jpg", visual_features=[VisualFeatures.READ, VisualFeatures.OBJECTS], language="en" ) for block in result.read.blocks: for line in block.lines: print(line.text)
Domain 4 · Computer Vision

4.2 — Custom Vision Models & Video Analysis

~20 min · 10–15% of exam

🎯
Custom Vision — Classification vs Detection

TypeOutputMin Training ImagesUse Case
Image ClassificationClass label + confidence for whole image5 per class"Is this a cat or dog?"
Object DetectionBounding boxes + labels for multiple objects15 per class"Find all cars in image"

Training workflow: Upload images → Label → Train → Evaluate (Precision/Recall/AP) → Publish → Consume via prediction URL

ℹ Code-first custom models
From Vision 4.0, you can train custom models code-first using the Florence-based model in Azure AI Foundry — no portal required. This is the exam's "Build a custom vision model code first" objective.

🎬
Video Analysis Services

ServiceCapabilitiesUse Case
Azure AI Video IndexerTranscription, speaker diarization, face detection, OCR in video, key frame extraction, content moderation, topic detectionMedia archives, searchable video, compliance
Spatial Analysis (Vision)Count people in zones, detect line crossing, track movement in video streamsRetail occupancy, safety compliance, queue management

Video Indexer insights include: transcript, OCR, keywords, labels, scenes, shots, keyframes, faces, named people, brands, sentiments.

Domain 5 · NLP

5.1 — Text Analytics & Translation

~25 min · 15–20% of exam

📝
Azure AI Language — Text Analytics Features

FeatureOutputNotes
Sentiment AnalysisPositive/Negative/Neutral/Mixed + confidence per sentenceOpinion mining shows aspect-level sentiment
Key Phrase ExtractionMain concepts/phrases from textMax 5,120 chars per document
Entity Recognition (NER)Named entities: Person, Org, Location, DateTime, etc.18 entity categories
Entity LinkingDisambiguate entities with Wikipedia linksUseful for knowledge graphs
PII DetectionPersonal data: SSN, email, phone, credit card, etc.Returns categories and redacted text
Language DetectionISO 639-1 language code + confidenceReturns "unknown" if confidence low
Text SummarizationExtractive or abstractive summaryExtractive: picks existing sentences

🌍
Azure Translator in Foundry Tools

  • Text translation: Translate text between 100+ languages in a single API call
  • Document translation: Translate entire documents while preserving layout — async batch operation
  • Transliteration: Convert script without translating (e.g., Arabic → Latin letters)
  • Language detection: Auto-detect source language
  • Dictionary lookup: Find alternative translations and word usage examples
  • Custom Translator: Fine-tune translation for domain-specific terminology (e.g., legal, medical)
import requests, uuid headers = {'Ocp-Apim-Subscription-Key': key, 'Content-type': 'application/json', 'X-ClientTraceId': str(uuid.uuid4())} body = [{'text': 'Hello, how are you?'}] r = requests.post( 'https://api.cognitive.microsofttranslator.com/translate?api-version=3.0&to=fr,de', headers=headers, json=body ) # Returns translations in French and German
Domain 5 · NLP

5.2 — Speech Services

~25 min · 15–20% of exam

🎙
Azure Speech in Foundry Tools

CapabilityDescriptionKey Config
Speech-to-Text (STT)Real-time or batch transcription from audioLanguage, audio format, diarization
Text-to-Speech (TTS)Convert text to natural speechVoice name, language, SSML
Speech TranslationReal-time speech translation to multiple targetsSource/target languages
Custom SpeechAdapt STT for domain vocabulary/accentsTraining data, acoustic model
Custom Neural VoiceCreate brand-specific TTS voiceRequires speaker consent + recording
Intent RecognitionIntegrate with LUIS/CLU for intent extraction from speechLUIS app ID required
Keyword RecognitionAlways-on wake-word detectionKeyword model file (.table)

🎨
SSML — Speech Synthesis Markup Language

SSML allows fine-grained control over TTS output:

<speak version="1.0" xml:lang="en-US"> <voice name="en-US-AriaNeural"> Welcome to <emphasis level="strong">Azure AI</emphasis>. <break time="500ms"/> <prosody rate="slow" pitch="+5%"> This is a demonstration of SSML control. </prosody> </voice> </speak>

SSML elements: <voice> (select voice), <prosody> (rate/pitch/volume), <break> (pause), <emphasis> (stress), <say-as> (dates, ordinals), <phoneme> (pronunciation).

🔊
Audio Formats for Speech

DirectionSupported FormatsRecommended
Input (STT)WAV (PCM 16-bit), MP3, OGG, FLAC, AMR, WebMWAV PCM 16kHz 16-bit mono
Output (TTS)WAV, MP3, OGG, RAW, RIFFMP3 for streaming, WAV for quality
Domain 5 · NLP

5.3 — Custom Language Models & QnA

~25 min · 15–20% of exam

🧠
Conversational Language Understanding (CLU)

CLU (successor to LUIS) builds custom intent/entity recognition models:

ComponentDescriptionExample
IntentThe action the user wants to perform"BookFlight", "CancelOrder"
EntityKey information extracted from utteranceCity, Date, ProductName
UtteranceSample user input used for training"Book a flight to Paris tomorrow"
None intentCatch-all for out-of-scope inputs

Workflow: Create project → Add intents/entities → Add utterances → Train → Evaluate (F1/Precision/Recall) → Deploy → Consume

⚠ Backup & Recovery
Export CLU model as JSON for backup. Import to restore. Models can be copied between projects using the export/import feature.

Custom Question Answering (QnA)

Build FAQ-style knowledge bases that answer questions from documents:

  • Sources: URLs (FAQ pages), files (PDF, DOCX, TXT, TSV), manual Q&A pairs
  • Multi-turn conversations: Add follow-up prompts to create guided conversation flows
  • Chit-chat: Add personality responses (Professional, Friendly, Witty, etc.)
  • Alternate phrasings: Add variant ways to ask the same question
  • Active learning: System suggests improvements based on low-confidence answers
  • Multi-language: Set project language or use per-document language detection
  • Export: Export as TSV or JSON for backup/migration

🌐
Custom Translator

  • Train custom translation models for domain-specific terminology
  • Training data: Parallel sentence pairs (source + target), bilingual documents
  • Min training data: 1,000 sentence pairs (recommend 10,000+)
  • Publish to custom category; call via category parameter in Translator API
  • Iterative improvement: Add more data, retrain, compare BLEU scores
Domain 6 · Knowledge Mining

6.1 — Azure AI Search

~30 min · 15–20% of exam

🔍
Core Architecture

Azure AI Search (formerly Cognitive Search) has 4 main components:

ComponentRoleKey Concepts
Data SourceConnection to raw data (Azure Blob, SQL, Cosmos DB, ADLS)Connection string, container/table name
IndexSchema defining searchable fields and their propertiesSearchable, Filterable, Sortable, Facetable, Retrievable
SkillsetAI enrichment pipeline applied during indexingBuilt-in + custom skills, knowledge store
IndexerOrchestrates the pipeline — reads source, applies skills, writes to indexSchedule, field mappings, output field mappings

Built-in Cognitive Skills

SkillInputOutput
OCR SkillImagetext, layoutText
Image Analysis SkillImagetags, description, objects
Entity Recognition Skilltextpersons, organizations, locations, etc.
Key Phrase SkilltextkeyPhrases
Sentiment Skilltextscore, label
Language Detection SkilltextlanguageCode, languageName
Split SkilltexttextItems (chunks)
Merge Skillmultiple stringsmergedText
Shaper Skillmultiple fieldscomplex shaped object
Azure OpenAI Embedding Skilltextvector embedding

📦
Custom Skills

Extend skillsets with custom logic:

  • Custom Web API skill: Call any HTTPS endpoint that accepts the skill contract (JSON in/out)
  • Azure Function skill: Easiest pattern — deploy Azure Function, register as custom skill
  • Azure ML skill: Call Azure ML endpoint for custom model inference
  • Input/Output schema: Must match skillset contract — array of values with document key

🔎
Query Types & Syntax

Query TypeSyntaxUse Case
SimpleDefault — keyword matching with +/- operatorsBasic search
Full LucenequeryType=full — wildcards, regex, fuzzy (~), proximity, boostingAdvanced text search
SemanticqueryType=semantic — AI re-ranking + captions/answersNatural language QA
Vectorvector field + nearest neighbor — cosine similaritySemantic similarity search
HybridFull-text + vector in single query — RRF fusion scoringBest of both — RAG pipelines
ℹ Knowledge Store
Knowledge Store projects enriched data to Azure Storage as table projections (structured), object projections (JSON files), or file projections (images/files). Useful for downstream BI and analysis.
Domain 6 · Knowledge Mining

6.2 — Azure Document Intelligence

~25 min · 15–20% of exam

📋
Prebuilt Models

ModelExtractsKey Fields
InvoiceVendor, customer, line items, totalsInvoiceId, VendorName, TotalTax, DueDate
ReceiptMerchant, transaction, items, totalsMerchantName, TransactionDate, Total
ID DocumentPassport, driver's license fieldsFirstName, LastName, DOB, DocumentNumber
Business CardContact informationContactNames, Emails, PhoneNumbers
W2US tax form fieldsEmployee, Employer, Wages, FederalTax
ReadText extraction only (no structure)content, pages, lines, words
LayoutText + structure (tables, selections)paragraphs, tables, selectionMarks
General DocumentKey-value pairs + layoutkeyValuePairs, entities

🎨
Custom Models

TypeWhen to UseMin Training Docs
Custom TemplateFixed-layout documents (forms with consistent structure)5 labeled samples
Custom NeuralVariable-layout documents (diverse formats)10 labeled samples
Composed ModelRoute between multiple custom models based on document typeN/A (combines existing models)

Labeling workflow: Upload samples to Azure Blob → Label in Document Intelligence Studio → Train → Evaluate (F1/accuracy per field) → Publish → Consume

ℹ Composed models
A composed model takes multiple custom models and automatically routes documents to the best model. Useful when you have documents from multiple templates/sources. The response includes a docType field indicating which sub-model was selected.

📁
Supported Formats & Limits

PropertyValue
File formatsPDF, JPEG, PNG, BMP, TIFF, HEIF (modern models)
Max file size500 MB per file
Max pages2,000 pages per request (standard)
Min image dimensions50 × 50 pixels
Max image dimensions10,000 × 10,000 pixels
Domain 6 · Knowledge Mining

6.3 — Azure Content Understanding

~15 min · 15–20% of exam

🌐
Azure Content Understanding Overview

Azure Content Understanding (in Foundry Tools) is a new multimodal extraction service that goes beyond Document Intelligence — it processes documents, images, videos, and audio to extract structured insights.

CapabilityDescription
OCR PipelineExtract text from images and documents — high accuracy with layout preservation
Document UnderstandingSummarize, classify, detect attributes, extract entities/tables/images from documents
Video ProcessingFrame analysis, scene detection, transcription, visual grounding in video
Audio ProcessingTranscription, speaker diarization, sentiment from audio
Field ExtractionSchema-based extraction — define fields you want and it extracts them from any content type
💡 Content Understanding vs Document Intelligence
Document Intelligence is specialized for structured/semi-structured documents (invoices, forms). Content Understanding handles a broader set of content types including video and audio and supports more flexible multimodal analysis. Use DI for specific form types; use CU for diverse content pipelines.
AI-102 Mock Exam — 50 Questions
Covers all 6 domains · Passing score: 70%
Score
Answered
0/50